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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Petkovich, P. Martin, White, Jay A., 

Beckett, Barbara R., Jones, Glenville 

(ii) TITLE OF INVENTION: Retinoid Metabolizing Protein 



(iii) NUMBER OF SEQUENCES: 35 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Blake, Cassels & Graydon 

(B) STREET: Box 25, Commerce Court West 

(C) CITY: Toronto 

(D) PROVINCE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: M5L 1A9 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3 1/2 inch, 1 . 4 Mb storage 

(B) COMPUTER: COMPAQ, IBM PC compatible 

(C) OPERATING SYSTEM: MS-DOS 5.1 

(D) SOFTWARE: WORD PERFECT 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBERS: 08/667,546/ 08/724,466 

(B) FILING DATE: June 21, 1996; October 1, 1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Hunt, John C. 

(B) REGISTRATION NUMBER: 36,424 

(C) REFERENCE /DOCKET NUMBER: 50767/00010 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) 863-4344 

(B) TELEFAX: (416) 863-2653 



(2) INFORMATION FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 



TGCCAGTGGA CAATCTCCCT AC CAAAT T C A CTAGTTATGT CCAGAAATTA GCCTAAACCG 60 

GAGCCTTTGT ACATATGTTT TTATTTTAGA TGAACTGTGA TGTATTGGAT ATTTTCTAAT 120 

TTGTTTATAT AAAGCAGATG TGTATATAAG TCTATGCGAA GAAGC GAAAA CGAGGGCACT 180 

ACTTTCTCAT GGATCACTGT AATGCTACAG AGTGTCTGTG AT GT ATATTT ATAATGTAGT 240 

T GT GT CAT AT AGCTTTTGTA CTGTATGCAA CTTATTTAAC TCGCTCTTTA T CT CAT GGGT 300 

TTTATTTAAT AAAACAT GTT CTTACAAAAA AAAAAAA 337 



(2) INFORMATION FOR SEQ ID NO: 2 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

Met Gly Leu Tyr Thr Leu Met Val Thr Phe Leu Cys Thr lie Val Leu 
15 10 15 

Pro Val Leu Leu Phe Leu Ala Ala Val Lys Leu Trp Glu Met Leu Met 

20 25 30 

lie Arg Arg Val Asp Pro Asn Cys Arg Ser Pro Leu Pro Pro Gly Thr 
35 40 45 

Met Gly Leu Pro Phe lie Gly Glu Thr Leu Gin Leu lie Leu Gin Arg 
50 55 60 

Arg Lys Phe Leu Arg Met Lys Arg Gin Lys Tyr Gly Cys lie Tyr Lys 
65 70 75 80 

Thr His Leu Phe Gly Asn Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

Val Arg Gin lie Leu Leu Gly Glu His Lys Leu Val Ser Val Gin Trp 

100 105 110 

Pro Ala Ser Val Arg Thr lie Leu Gly Ser Asp Thr Leu Ser Asn Val 
115 120 125 

His Gly Val Gin His Lys Asn Lys Lys Lys Ala lie Met Arg Ala Phe 
130 135 140 

Ser Arg Asp Ala Leu Glu His Tyr lie Pro Val lie Gin Gin Glu Val 
145 150 155 160 

Lys Ser Ala lie Gin Glu Trp Leu Gin Lys Asp Ser Cys Val Leu Val 

165 170 175 

Tyr Pro Glu Met Lys Lys Leu Met Phe Arg lie Ala Met Arg lie Leu 

180 185 190 

Leu Gly Phe Glu Pro Glu Gin lie Lys Thr Asp Glu Gin Glu Leu Val 
195 200 205 

Glu Ala Phe Glu Glu Met lie Lys Asn Leu Phe Ser Leu Pro lie Asp 
210 215 220 

Val Pro Phe Ser Gly Leu Tyr Arg Gly Leu Arg Ala Arg Asn Phe lie 
225 230 235 240 

His Ser Lys lie Glu Glu Asn lie Arg Lys Lys lie Gin Asp Asp Asp 

245 250 255 

Asn Glu Asn Glu Gin Lys Tyr Lys Asp Ala Leu Gin Leu Leu lie Glu 

260 265 270 

Asn Ser Arg Arg Ser Asp Glu Pro Phe Ser Leu Gin Ala Met Lys Glu 
275 280 285 

Ala Ala Thr Glu Leu Leu Phe Gly Gly His Glu Thr Thr Ala Ser Thr 
290 295 300 



Ala Thr Ser Leu Val Met Phe Leu Gly Leu Asn Thr Glu Val Val Gin 
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305 310 315 320 

Lys Val Arg Glu Glu Val Gin Glu Lys Val Glu Met Gly Met Tyr Thr 

325 330 335 

Pro Gly Lys Gly Leu Ser Met Glu Leu Leu Asp Gin Leu Lys Tyr Thr 

340 345 350 

Gly Cys Val lie Lys Glu Thr Leu Arg lie Asn Pro Pro Val Pro Gly 
355 360 365 

Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn Gly Tyr Gin lie 
370 375 380 

Pro Lys Gly Trp Asn Val lie Tyr Ser lie Cys Asp Thr His Asp Val 
385 390 395 400 

Ala Asp Val Phe Pro Asn Lys Glu Glu Phe Gin Pro Glu Arg Phe Met 

405 410 415 

Ser Lys Gly Leu Glu Asp Gly Ser Arg Phe Asn Tyr lie Pro Phe Gly 

420 425 430 

Gly Gly Ser Arg Met Cys Val Gly Lys Glu Phe Ala Lys Val Leu Leu 
435 440 445 

Lys lie Phe Leu Val Glu Leu Thr Gin His Cys Asn Trp lie Leu Ser 
450 455 460 

Asn Gly Pro Pro Thr Met Lys Thr Gly Pro Thr lie Tyr Pro Val Asp 
465 470 475 480 

Asn Leu Pro Thr Lys Phe Thr Ser Tyr Val Arg Asn 

485 490 

(2) INFORMATION FOR SEQ ID NO; 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1850 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

TGTCGCCGTT GCTGTCGGTT GCTGTCGGAC GCTGTCTCCT CTCCAGAAGC TTGTTTTTCG 60 

TTTTGGCGAT CAGTTGCGCG CTTCAAC ATG GGG CTG TAC ACC CTT ATG GTC ACC 114 

Met Gly Leu Tyr Thr Leu Met Val Thr 
1 5 

TTT CTC TGC ACC ATC GTG CTA CCC GTT TTA CTC TTT CTC GCC GCG GTG 162 
Phe Leu Cys Thr lie Val Leu Pro Val Leu Leu Phe Leu Ala Ala Val 
10 15 20 25 

AAG TTG TGG GAG ATG TTA ATG ATC CGA CGA GTC GAT CCG AAC TGC AGA 210 
Lys Leu Trp Glu Met Leu Met lie Arg Arg Val Asp Pro Asn Cys Arg 

30 35 40 

AGT CCT CTA CCG CCA GGT ACC ATG GGC TTG CCG TTC ATT GGA GAA ACG 2 58 

Ser Pro Leu Pro Pro Gly Thr Met Gly Leu Pro Phe lie Gly Glu Thr 

45 50 55 

CTC CAG CTG ATC CTC CAG AGA AGG AAG TTT CTG CGC ATG AAA CGG CAG 3 06 

Leu Gin Leu lie Leu Gin Arg Arg Lys Phe Leu Arg Met Lys Arg Gin 
60 65 70 
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AAA TAC GGG TGC ATC TAG AAG ACG CAC CTC TTC GGG AAC CCG ACT GTC 
Lys Tyr Gly Cys lie Tyr Lys Thr His Leu Phe Gly Asn Pro Thr Val 
75 80 85 



354 



AGG GTG ATG GGA GOT GAT AAT GTG AGG CAG ATT CTG CTG GGC GAA CAC 
Arg Val Met Gly Ala Asp Asn Val Arg Gin lie Leu Leu Gly Glu His 
90 95 100 105 



402 



AAG CTG GTG TCT GTT CAG TGG CCA GCA TCA GTG AGA ACC ATC CTG GGC 
Lys Leu Val Ser Val Gin Trp Pro Ala Ser Val Arg Thr lie Leu Gly 

110 115 120 



450 



TCT GAC ACC CTC TCC AAT GTC CAT GGA GTT CAA CAC AAA AAC AAG AAA 
Ser Asp Thr Leu Ser Asn Val His Gly Val Gin His Lys Asn Lys Lys 

125 130 135 



498 



AAG GCC ATT ATG AGG GCG TTC TCT CGA GAT GCT CTG GAG CAC TAC ATT 
Lys Ala lie Met Arg Ala Phe Ser Arg Asp Ala Leu Glu His Tyr lie 
140 145 150 



546 



CCC GTG ATC CAG CAG GAG GTG AAG AGC GCC ATA CAG GAA TGG CTG CAA 
Pro Val lie Gin Gin Glu Val Lys Ser Ala lie Gin Glu Trp Leu Gin 
155 160 165 



594 



AAA GAC TCC TGC GTG CTG GTT TAT CCA GAA ATG AAG AAA CTC ATG TTT 
Lys Asp Ser Cys Val Leu Val Tyr Pro Glu Met Lys Lys Leu Met Phe 
170 175 180 185 



642 



CGG ATA GCT ATG AGA ATC CTG CTT GGT TTT GAA CCA GAG CAA ATA AAG 
Arg lie Ala Met Arg lie Leu Leu Gly Phe Glu Pro Glu Gin lie Lys 

190 195 200 



690 



ACG GAC GAG CAA GAA CTG GTG GAA GCT TTT GAG GAA ATG ATC AAA AAC 
Thr Asp Glu Gin Glu Leu Val Glu Ala Phe Glu Glu Met lie Lys Asn 

205 210 215 



738 



TTG TTC TCC TTG CCA ATC GAC GTT CCT TTC AGT GGT CTG TAC AGG GGT 
Leu Phe Ser Leu Pro lie Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly 
220 225 230 



786 



TTG AGG GCA CGC AAT TTC ATT CAC TCC AAA ATT GAG GAA AAC ATC AGG 
Leu Arg Ala Arg Asn Phe lie His Ser Lys lie Glu Glu Asn lie Arg 
235 240 245 



834 



AAG AAA ATT CAA GAT GAC GAC AAT GAA AAC GAA CAG AAA TAC AAA GAC 
Lys Lys lie Gin Asp Asp Asp Asn Glu Asn Glu gin Lys Tyr Lys Asp 
250 255 260 265 



882 



GCC CTT CAG CTG TTG ATC GAG AAC AGC AGA AGA AGT GAC GAA CCT TTT 
Ala Leu Gin Leu Leu lie Glu Asn Ser Arg Arg Ser Asp Glu Pro Phe 

270 275 280 



930 



AGT TTG CAG GCG ATG AAA GAA GCA GCT ACA GAG CTT CTA TTT GGA GGT 
Ser Leu Gin Ala Met Lys Glu Ala Ala Thr Glu Leu Leu Phe Gly Gly 

285 290 295 



978 



CAT GAA ACC ACC GCC AGC ACT GCA ACC TCA CTT GTC ATG TTT CTG GGT 
His Glu Thr Thr Ala Ser Thr Ala Thr Ser Leu Val Met Phe Leu Gly 
300 305 310 



1026 



CTG AAC ACA GAA GTG GTG CAG AAG GTC AGA GAG GAG GTT CAG GAG AAG 
Leu Asn Thr Glu Val Val Gin Lys Val Arg Glu Glu Val Gin Glu Lys 
315 320 325 



1074 



1 
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GTT GAA ATG GGC ATG TAT ACA CCT GGA AAG GGC TTG AGT ATG GAG CTG 1122 
Val Glu Met Gly Met Tyr Thr Pro Gly Lys Gly Leu Ser Met Glu Leu 
330 335 340 345 

TTG GAC CAG CTG AAG TAC ACT GGA TGT GTG ATT AAA GAG ACT CTT AGA 117 0 

Leu Asp Gin Leu Lys Tyr Thr Gly Cys Val lie Lys Glu Thr Leu Arg 

350 355 360 

ATC AAC CCT CCT GTT CCC GGA GGA TTC AGA GTC GCA CTC AAA ACC TTT 1218 
lie Asn Pro Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe 

365 370 375 

GAA TTG AAT GGT TAC CAA ATT CCT AAA GGA TGG AAC GTC ATT TAC AGC 1266 
Glu Leu Asn Gly Tyr Gin lie Pro Lys Gly Trp Asn Val lie Tyr Ser 
380 385 390 

ATC TGT GAC ACG CAC GAT GTG GCC GAC GTC TTT CCA AAC AAA GAG GAG 1314 
lie Cys Asp Thr His Asp Val Ala Asp Val Phe Pro Asn Lys Glu Glu 
395 400 405 

TTC CAG CCG GAG AGA TTC ATG AGC AAA GGT CTG GAG GAC GGG TCC AGG 1362 
Phe Gin Pro Glu Arg Phe Met Ser Lys Gly Leu Glu Asp Gly Ser Arg 
410 415 420 425 

TTT AAC TAC ATC CCC TTC GGA GGA GGA TCC AGG ATG TGT GTG GGC AAA 1410 
Phe Asn Tyr lie Pro Phe Gly Gly Gly Ser Arg Met Cys Val Gly Lys 

430 435 440 

GAG TTC GCC AAA GTG TTA CTC AAG ATC TTT TTA GTT GAG TTA ACG CAG 1458 
Glu Phe Ala Lys Val Leu Leu Lys lie Phe Leu Val Glu Leu Thr Gin 

445 450 455 

CAT TGC AAT TGG ATT CTC TCA AAC GGA CCC CCG ACA ATG AAA ACA GGC 1506 
His Cys Asn Trp lie Leu Ser Asn Gly Pro Pro Thr Met Lys Thr Gly 
460 465 470 

CCG ACT ATT TAC CCA GTG GAC AAT CTC CCT ACC AAA TTC ACT AGT TAT 1554 
Pro Thr lie Tyr Pro Val Asp Asn Leu Pro Thr Lys Phe Thr Ser Tyr 
475 480 485 

GTC AGA AAT TAGCCTAACC GGAGCTTTGT AC AT AT GT T T T TAT T TT AGA 1603 

Val Arg Asn 

490 

TGAACTGTGA TGTATTGGAT ATTTTCTATT TTGTTTATAT AAAGCAGATG T GT AT AT AAG 1663 

TCTATGCGAG GAAGCGAAAA CGAGGGCACT ACTTTCTCAT GGATCACTGT AAT GCTACAG 1723 

AGTGTCTGTG ATGTATATTT ATAATGTAGT TGTGTTATAT AGCTTTTGTA CTGTATGCAA 1783 

CTTATTTAAC TCGCTCTTTA T CT CAT GGGT TTTATTTAAT AAAACAT GTT CTTACAAAAA 1843 

AAAAAAA 1850 



i 



(2) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 
15 10 15 
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Pro Leu Leu Leu Phe Leu Ala Ala lie Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 

Val Ser Gly Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 

Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 

Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe lie Tyr Lys 
65 70 75 80 

Thr His Leu Phe Gly Arg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

Val Arg Arg lie Leu Leu Gly Asp Asp Arg Leu Val Ser Val His Trp 

100 105 110 

Pro Ala Ser Val Arg Thr lie Leu Gly Ser Gly Cys Leu Ser Asn Leu 
115 120 125 

His Asp Ser Ser His Lys Gin Arg Lys Lys Val He Met Arg Ala Phe 
130 135 140 

Ser Arg Glu Ala Leu Glu Cys Tyr Val Pro Val He Thr Glu Glu Val 
145 150 155 160 

Gly Ser Ser Leu Glu Gin Trp Leu Ser Cys Gly Glu Arg Gly Leu Leu 

165 170 175 

Val Tyr Pro Glu Val Lys Arg Leu Met Phe Arg He Ala Met Arg He 

180 185 190 

Leu Leu Gly Cys Glu Pro Gin Leu Ala Gly Asp Gly Asp Ser Glu Gin 
195 200 205 

Gin Leu Val Glu Ala Phe Glu Glu Met Thr Arg Asn Leu Phe Ser Leu 
210 215 220 

Pro He Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly Met Lys Ala Arg 
225 230 235 240 

Asn Leu He His Ala Arg He Glu Gin Asn He Arg Ala Lys He Cys 

245 250 255 

Gly Leu Arg Ala Ser Glu Ala Gly Gin Gly Cys Lys Asp Ala Leu Gin 

260 265 270 

Leu Leu He Glu His Ser Trp Glu Arg Gly Glu Arg Leu Asp Met Gin 
275 280 285 

Ala Leu Lys Gin Ser Ser Thr Glu Leu Leu Phe Gly Gly His Glu Thr 
290 295 300 

Thr Ala Ser Ala Ala Thr Ser Leu He Thr Tyr Leu Gly Leu Tyr Pro 
305 310 315 320 

His Val Leu Gin Lys Val Arg Glu Glu Leu Lys Ser Lys Gly Leu Leu 

325 330 335 

Cys Lys Ser Asn Gin Asp Asn Lys Leu Asp Met Glu He Leu Glu Gin 

340 345 350 



Leu Lys Tyr He Gly Cys Val He Lys Glu Thr Leu Arg Leu Asn Pro 
355 360 365 
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Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn 
370 375 380 

Gly Tyr Gin lie Pro Lys Gly Trp Asn Val lie Tyr Ser lie Cys Asp 
385 390 395 400 

Thr His Asp Val Ala Glu lie Phe Thr Asn Lys Glu Glu Phe Asn Pro 

405 410 415 

Asp Arg Phe Ser Ala Pro His Pro Glu Asp Ala Ser Arg Phe Ser Phe 

420 425 430 

lie Pro Phe Gly Gly Gly Leu Arg Ser Cys Val Gly Lys Glu Phe Ala 
435 440 445 

Lys lie Leu Leu Lys lie Phe Thr Val Glu Leu Ala Arg His Cys Asp 
450 455 460 

Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 480 

Tyr Pro Val Asp Asn Leu Pro Ala Arg Phe Thr His Phe His Gly Glu 

485 490 495 

lie 



(2) INFORMATION FOR SEQ ID NO : 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 



ATG GGG CTC CCG GCG CTG CTG GCC AGT GCG CTC TGC ACC TTC GTG CTG 
Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 
15 10 15 



48 



CCG CTG CTG CTC TTC CTG GCT GCG ATC AAG CTC TGG GAC CTG TAC TGC 
Pro Leu Leu Leu Phe Leu Ala Ala lie Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 



96 



GTG AGC GGC CGC GAC CGC AGT TGT GCC CTC CCA TTG CCC CCC GGG ACT 
Val Ser Gly Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 



144 



ATG GGC TTC CCC TTC TTT GGG GAA ACC TTG CAG ATG GTA CTG CAG CGG 
Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 



192 



AGG AAG TTC CTG CAG ATG AAG CGC AGG AAA TAC GGC TTC ATC TAC AAG 
Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe lie Tyr Lys 
65 70 75 80 



240 



ACG CAT CTG TTC GGG CGG CCC ACC GTA CGG GTG ATG GGC GCG GAC AAT 
Thr His Leu Phe Gly Arg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 



288 



GTG CGG CGC ATC TTG CTC GGA GAC GAC CGG CTG GTG TCG GTC CAC TGG 
Val Arg Arg lie Leu Leu Gly Asp Asp Arg Leu Val Ser Val His Trp 

100 105 110 



336 
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CCA GCG TCG GTG CGC ACC ATT CTG GGA TCT GGC TGC CTC TCT AAC CTG 384 

Pro Ala Ser Val Arg Thr lie Leu Gly Ser Gly Cys Leu Ser Asn Leu 
115 120 125 

CAC GAC TCC TCG CAC AAG CAG CGC AAG AAG GTG ATT ATG CGG GCC TTC 432 
His Asp Ser Ser His Lys Gin Arg Lys Lys Val lie Met Arg Ala Phe 
130 135 140 

AGC CGC GAG GCA CTC GAA TGC TAC GTG CCG GTG ATC ACC GAG GAA GTG 48 0 

Ser Arg Glu Ala Leu Glu Cys Tyr Val Pro Val lie Thr Glu Glu Val 
145 150 155 160 

GGC AGC AGC CTG GAG CAG TGG CTG AGC TGC GGC GAG CGC GGC CTC CTG 528 
Gly Ser Ser Leu Glu Gin Trp Leu Ser Cys Gly Glu Arg Gly Leu Leu 

165 170 175 

GTC TAC CCC GAG GTG AAG CGC CTC ATG TTC CGA ATC GCC ATG CGC ATC 57 6 

Val Tyr Pro Glu Val Lys Arg Leu Met Phe Arg lie Ala Met Arg lie 

180 185 190 

CTA CTG GGC TGC GAA CCC CAA CTG GCG GGC GAC GGG GAC TCC GAG CAG 624 
Leu Leu Gly Cys Glu Pro Gin Leu Ala Gly Asp Gly Asp Ser Glu Gin 
195 200 205 

CAG CTT GTG GAG GCC TTC GAG GAA ATG ACC CGC AAT CTC TTC TCG CTG 672 
Gin Leu Val Glu Ala Phe Glu Glu Met Thr Arg Asn Leu Phe Ser Leu 
210 215 220 

CCC ATC GAC GTG CCC TTC AGC GGG CTG TAC CGG GGC ATG AAG GCG CGG 720 
Pro lie Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly Met Lys Ala Arg 
225 230 235 240 

AAC CTC ATT CAC GCG CGC ATC GAG CAG AAC ATT CGC GCC AAG ATC TGC 7 68 

Asn Leu lie His Ala Arg lie Glu Gin Asn lie Arg Ala Lys lie Cys 

245 250 255 

GGG CTG CGG GCA TCC GAG GCG GGC CAG GGC TGC AAA GAC GCG CTG CAG 816 
Gly Leu Arg Ala Ser Glu Ala Gly Gin Gly Cys Lys Asp Ala Leu Gin 

260 265 270 

CTG TTG ATC GAG CAC TCG TGG GAG AGG GGA GAG CGG CTG GAC ATG CAG 8 64 

Leu Leu lie Glu His Ser Trp Glu Arg Gly Glu Arg Leu Asp Met Gin 
275 280 285 

GCA CTA AAG CAA TCT TCA ACC GAA CTC CTC TTT GGA GGA CAC GAA ACC 912 
Ala Leu Lys Gin Ser Ser Thr Glu Leu Leu Phe Gly Gly His Glu Thr 
290 295 300 

ACG GCC AGT GCA GCC ACA TCT CTG ATC ACT TAC CTG GGG CTC TAC CCA 960 
Thr Ala Ser Ala Ala Thr Ser Leu lie Thr Tyr Leu Gly Leu Tyr Pro 
305 310 315 320 

CAT GTT CTC CAG AAA GTG CGA GAA GAG CTG AAG AGT AAG GGT TTA CTT 1008 
His Val Leu Gin Lys Val Arg Glu Glu Leu Lys Ser Lys Gly Leu Leu 

325 330 335 

TGC AAG AGC AAT CAA GAC AAC AAG TTG GAC ATG GAA ATT TTG GAA CAA 1056 
Cys Lys Ser Asn Gin Asp Asn Lys Leu Asp Met Glu lie Leu Glu Gin 

340 345 350 

CTT AAA TAC ATC GGG TGT GTT ATT AAG GAG ACC CTT CGA CTG AAT CCC 1104 
Leu Lys Tyr lie Gly Cys Val lie Lys Glu Thr Leu Arg Leu Asn Pro 
355 360 365 
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CCA GTT CCA GGA GGG TTT CGG GTT GCT CTG AAG ACT TTT GAA TTA AAT 
Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn 
370 375 380 



1152 



GGA TAC CAG ATT CCC AAG GGC TGG AAT GTT ATC TAC AGT ATC TGT GAT 
Gly Tyr Gin lie Pro Lys Gly Trp Asn Val lie Tyr Ser lie Cys Asp 
385 390 395 400 



1200 



ACT CAT GAT GTG GCA GAG ATC TTC ACC AAC AAG GAA GAA TTT AAT CCT 
Thr His Asp Val Ala Glu lie Phe Thr Asn Lys Glu Glu Phe Asn Pro 

405 410 415 



1248 



GAC CGA TTC AGT GCT CCT CAC CCA GAG GAT GCA TCC AGG TTC AGC TTC 
Asp Arg Phe Ser Ala Pro His Pro Glu Asp Ala Ser Arg Phe Ser Phe 

420 425 430 



1296 



ATT CCA TTT GGA GGA GGC CTT AGG AGC TGT GTA GGC AAA GAA TTT GCA 
lie Pro Phe Gly Gly Gly Leu Arg Ser Cys Val Gly Lys Glu Phe Ala 
435 440 445 



1344 



AAA ATT CTT CTC AAA ATA TTT ACA GTG GAG CTG GCC AGG CAT TGT GAC 
Lys lie Leu Leu Lys lie Phe Thr Val Glu Leu Ala Arg His Cys Asp 
450 455 460 

TGG CAG CTT CTA AAT GGA CCT CCT ACA ATG AAA ACC AGT CCC ACC GTG 
Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 480 



1392 



1440 



TAT CCT GTG GAC AAT CTC CCT GCA AGA TTC ACC CAT TTC CAT GGG GAA 
Tyr Pro Val Asp Asn Leu Pro Ala Arg Phe Thr His Phe His Gly Glu 

485 490 495 



1488 



ATC TGA 
lie 



1494 



(2) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Pro Phe Gly Gly Gly Pro Arg Leu Cys Pro Gly Tyr Glu Leu Ala Arg 
15 10 15 

Val Ala Leu Ser 

20 



(2) INFORMATION FOR SEQ ID NO : 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 

Pro Phe Ser Gly Gly Ala Arg Asn Cys lie Gly Lys Gin Phe Ala Met 
1 5 10 15 



Ser Glu Met Lys 

20 
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(2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Pro Phe Ser Gly Gly Ala Arg Asn Cys lie Gly Lys Gin Phe Ala Met 
1 5 10 15 

Asn Glu Leu Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

Pro Phe Gly Thr Gly Pro Arg Asn Cys lie Gly Met Arg Phe Ala lie 
15 10 15 

Met Asn Met Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Pro Phe Ser Gly Gly Ser Arg Asn Cys lie Gly Lys Gin Phe Ala Met 
15 10 15 

Asn Glu Leu Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GAACTCCTCT TTGGAGGACA CGAAACCACG GCCAGTGCAG CCACATCTCT GATCACTTAC 
CTGGGGCTCT AC C C AC AT GT TCTCCAGAAA GTGCGAGAAG AGCTGAAGAG TAAGGGTTTA 
CTTTGCAAGA GCAAT CAAGA CAACAAGTTG GACAT GGAAA TTTTGGAACA ACTTAAATAC 
ATCGGGTGTG TTAT TAAGGA GACCCTTCGA CTGAATCCCC CAGTTCCAGG AGGGTTTCGG 
GTTGCTCTGA AGACTTTTGA ATTAAAT GGA T AC CAGATT C C CAAGGGCT G GAATGTTATC 
TACAGTATCT GTGATACTCA TGATGTGGCA GAGATCTTCA CCAACAAGGA A 



60 
120 
180 
240 
300 
351 
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(2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 
TTTTTTTTTT TTGG 



(2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 

TTTTTTTTTT TTGA 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
TTTTTTTTTT TTGT 

(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

TTTTTTTTTT TTGC 



(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

TTTTTTTTTT TTAG 



(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 

TTTTTTTTTT TTAA 



* 
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(2). INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 



TTTTTTTTTT TT^^T 



14 



(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY : linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



TTTTTTTTTT TTAC 



14 



(2) INFORMATION FOR SEQ ID NO: 20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
TTTTTTTTTT TTCG 



14 



(2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 



TTTTTTTTTT TTCA 



14 



(2) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 



TTTTTTTTTT TTCT 



14 



(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



TTTTTTTTTT TTCC 



14 
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(2) INFORMATION FOR SEQ ID NO: 24 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
AAGCGACCGA 



(2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

TGTTCGCCAG 



(2) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

TGCCAGTGGA 



(2) INFORMATION FOR SEQ ID NO: 27 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
GGCTGCAAAC 



(2) INFORMATION FOR SEQ ID NO: 28 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

CCTAGCGTTG 



(2) INFORMATION FOR SEQ ID NO: 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
GTAGCGGCCG CTGCCAGTGG A 
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(2) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

GTAGCGGCCG CT 12 

(2) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 . 

GCAC GAGGGA GGCTGAAGCG TGCC ATG GGG CTC CCG GCG CTG CTG GCC AGT 51 

Met Gly Leu Pro Ala Leu Leu Ala Ser 
1 5 

GCG CTC TGC ACC TTC GTG CTG CCG CTG CTG CTC TTC CTG GCG GCG CTC 99 
Ala Leu Cys Thr Phe Val Leu Pro Leu Leu Leu Phe Leu Ala Ala Leu 
10 15 20 25 

AAG CTC TGG GAC CTG TAC TGT GTG AGC AGC CGC GAT CGC AGC TGC GCC 147 
Lys Leu Trp Asp Leu Tyr Cys Val Ser Ser Arg Asp Arg Ser Cys Ala 

30 35 40 

CTC CCC TTG CCC CCC GGT ACC ATG GGC TTC CCA TTC TTT GGG GAA ACA 195 
Leu Pro Leu Pro Pro Gly Thr Met Gly Phe Pro Phe Phe Gly Glu Thr 

45 50 55 

TTG CAG ATG GTG CTT CAG CGG AGG AAG TTT CTG CAG ATG AAG CGC AGG 243 
Leu Gin Met Val Leu Gin Arg Arg Lys Phe Leu Gin Met Lys Arg Arg 
60 65 70 

AAA TAC GGC TTC ATC TAC AAG ACG CAT CTG TTT GGG CGG CCC ACG GTG 2 91 

Lys Tyr Gly Phe lie Tyr Lys Thr His Leu Phe Gly Arg Pro Thr Val 
75 80 85 

CGG GTG ATG GGC GCG GAT AAT GTG CGG CGC ATC TTG CTG GGA GAG CAC 339 
Arg Val Met Gly Ala Asp Asn Val Arg Arg lie Leu Leu Gly Glu His 
90 95 100 105 

CGG TTG GTG TCG GTG CAC TGG CCC GCG TCG GTG CGC ACC ATC CTG GGC 387 
Arg Leu Val Ser Val His Trp Pro Ala Ser Val Arg Thr lie Leu Gly 

110 115 120 

GCT GGC TGC CTC TCC AAC CTG CAC GAT TCC TCG CAC AAG CAG CGA AAG 435 
Ala Gly Cys Leu Ser Asn Leu His Asp Ser Ser His Lys Gin Arg Lys 

125 130 135 

AAG GTG ATT ATG CAG GCC TTC AGC CGC GAG GCA CTC CAG TGC TAC GTG 4 83 

Lys Val lie Met Gin Ala Phe Ser Arg Glu Ala Leu Gin Cys Tyr Val 
140 145 150 

CTC GTG ATC GCT GAG GAA GTC AGC AGT TGT CTG GAG CAG TGG CTA AGC 531 
Leu Val lie Ala Glu Glu Val Ser Ser Cys Leu Glu Gin Trp Leu Ser 
155 160 165 
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TGC GGC GAG CGC GGC CTC CTG GTC TAC CCC GAG GTG AAG CGC CTC ATG 
Cys Gly Glu Arg Gly Leu Leu Val Tyr Pro Glu Val Lys Arg Leu Met 
170 175 180 185 

TTC CGC ATC GCC ATG CGC ATC CTG CTG GGC TGC GAG CCG GGT CCA GCG 
Phe Arg lie Ala Met Arg lie Leu Leu Gly Cys Glu Pro Gly Pro Ala 

190 195 200 

GGC GGC GGG GAG GAC GAG CAA CAG CTC GTG GAG GCT TTC GAG GAG ATG 
Gly Gly Gly Glu Asp Glu Gin Gin Leu Val Glu Ala Phe Glu Glu Met 

205 210 215 

ACC CGC AAT CTC TTC TCT CTT CCC ATT GAC GTG CCC TTT AGC GGC CTG 
Thr Arg Asn Leu Phe Ser Leu Pro lie Asp Val Pro Phe Ser Gly Leu 
220 225 230 

TAC CGG GGC GTG AAG GCG CGG AAC CTT ATA CAC GCG CGC ATC GAG GAG 
Tyr Arg Gly Val Lys Ala Arg Asn Leu lie His Ala Arg lie Glu Glu 
235 240 245 

AAC ATT CGC GCC AAG ATC CGC CGG CTT CAG GCT ACA GAG CCG GAT GGG 
Asn lie Arg Ala Lys He Arg Arg Leu Gin Ala Thr Glu Pro Asp Gly 
250 255 260 265 

GGT TGC AAG GAC GCG CTG CAG CTC CTG ATT GAG CAC TCG TGG GAG AGG 
Gly Cys Lys Asp Ala Leu Gin Leu Leu He Glu His Ser Trp Glu Arg 

270 275 280 

GGA GAG AGG CTG GAT ATG CAG GCA CTA AAA CAA TCG TCA ACA GAG CTC 
Gly Glu Arg Leu Asp Met Gin Ala Leu Lys Gin Ser Ser Thr Glu Leu 

285 290 295 

CTC TTT GGT GGT CAT GAA ACT ACA GCC AGT GCT GCG ACA TCA CTG ATC 
Leu Phe Gly Gly His Glu Thr Thr Ala Ser Ala Ala Thr Ser Leu He 
300 305 310 

ACT TAC CTA GGA CTC TAC CCA CAT GTC CTC CAG AAA GTT CGA GAA GAG 
Thr Tyr Leu Gly Leu Tyr Pro His Val Leu Gin Lys Val Arg Glu Glu 
315 320 325 

ATA AAG AGC AAG GGC TTA CTT TGC AAG AGC AAT CAA GAC AAC AAG TTA 
He Lys Ser Lys Gly Leu Leu Cys Lys Ser Asn Gin Asp Asn Lys Leu 
330 335 340 345 

GAC ATG GAA ACT TTG GAA CAG CTT AAA TAC ATT GGG TGT GTC ATT AAG 
Asp Met Glu Thr Leu Glu Gin Leu Lys Tyr He Gly Cys Val He Lys 

350 355 360 

GAG ACC CTG CGA TTG AAT CCT CCG GTT CCA GGA GGG TTT CGG GTT GCT 
Glu Thr Leu Arg Leu Asn Pro Pro Val Pro Gly Gly Phe Arg Val Ala 

365 370 375 

CTG AAG ACT TTT GAG CTG AAT GGA TAC CAG ATC CCC AAG GGC TGG AAT 
Leu Lys Thr Phe Glu Leu Asn Gly Tyr Gin He Pro Lys Gly Trp Asn 
380 385 390 

GTT ATT TAC AGT ATC TGT GAC ACC CAC GAT GTG GCA GAT ATC TTC ACT 
Val He Tyr Ser He Cys Asp Thr His Asp Val Ala Asp He Phe Thr 
395 400 405 

AAC AAG GAG GAA TTT AAT CCC GAC CGC TTT ATA GTG CCT CAT CCA GAG 
Asn Lys Glu Glu Phe Asn Pro Asp Arg Phe He Val Pro His Pro Glu 
410 415 420 425 



t 



« i 



3 
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GAT GCT TCC CGG TTC AGC TTC ATT CCA TTT GGA GGA GGC CTT CGG AGC 1347 
Asp Ala Ser Arg Phe Ser Phe lie Pro Phe Gly Gly Gly Leu Arg Ser 

430 435 440 

TGT GTA GGC AAA GAG TTT GCA AAA ATT CTT CTT AAG ATA TTT ACA GTG 1395 
Cys Val Gly Lys Glu Phe Ala Lys lie Leu Leu Lys lie Phe Thr Val 

445 450 455 

GAG CTG GCT AGG CAC TGT GAT TGG CAG CTT CTA AAT GGA CCT CCT ACA 1443 
Glu Leu Ala Arg His Cys Asp Trp Gin Leu Leu Asn Gly Pro Pro Thr 
460 465 470 

ATG AAG ACA AGC CCC ACT GTG TAC CCT GTG GAC AAT CTC CCT GCA AGA 1491 
Met Lys Thr Ser Pro Thr Val Tyr Pro Val Asp Asn Leu Pro Ala Arg 
475 480 485 

TTC ACC TAC TTC CAG GGA GAT ATC T GAT AGCTAT TTCAATTCTT 1535 
Phe Thr Tyr Phe Gin Gly Asp lie 
490 495 

GGACTTATTT GAAGT GT AT A TTGGTTTTTT T T AAAAAT AG TGTCATGTTG ACTTTATTTA 1595 

ATTTCTAAAT GTATAGTATG AT ATT TAT GT GTCTCTACTA CAGTCCCGTG GT CTTTAAAT 1655 

I Ql ATTAAAATAA TGAATTTGTA TGATTTCCCA ATAAAGTAAA ATTAAAAAGT GAAAAAAAAA 1715 

\ ]J AAAAAAAAAA 1725 

1 UJ (2) INFORMATION FOR SEQ ID NO: 32 
;» (i) SEQUENCE CHARACTERISTICS: 

\ O (A) LENGTH: 497 amino acids 

.* I (B) TYPE: amino acid 

{ 51 (D) TOPOLOGY: linear 

MJf (D) TOPOLOGY: linear 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

; ~ Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 

1 5 10 15 

* Pro Leu Leu Leu Phe Leu Ala Ala Leu Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 

," ] Val Ser Ser Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 

[ 35 40 45 

. I Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 

j 50 55 60 

* : Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe He Tyr Lys 
i 65 70 75 80 

Thr His Leu Phe Gly Arg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

Val Arg Arg He Leu Leu Gly Glu His Arg Leu Val Ser Val His Trp 
j 100 105 110 

« i Pro Ala Ser Val Arg Thr He Leu Gly Ala Gly Cys Leu Ser Asn Leu 

115 120 125 



i 



His Asp Ser Ser His Lys Gin Arg Lys Lys Val He Met Gin Ala Phe 
130 135 140 



1 
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Ser Arg Glu Ala 
145 

Ser Ser Cys Leu 



Val Tyr Pro Glu 

180 

Leu Leu Gly Cys 
195 

Gin Leu Val Glu 
210 

Pro lie Asp Val 
225 

Asn Leu lie His 



Arg Leu Gin Ala 

260 

Leu Leu lie Glu 
275 

Ala Leu Lys Gin 
290 

Thr Ala Ser Ala 
305 

His Val Leu Gin 



Cys Lys Ser Asn 

340 

Leu Lys Tyr lie 
355 

Pro Val Pro Gly 
370 

Gly Tyr Gin lie 
385 

Thr His Asp Val 



Asp Arg Phe lie 

420 

lie Pro Phe Gly 
435 

Lys lie Leu Leu 
450 



Leu Gin Cys Tyr 
150 

Glu Gin Trp Leu 
165 

Val Lys Arg Leu 



Glu Pro Gly Pro 

200 

Ala Phe Glu Glu 
215 

Pro Phe Ser Gly 
230 

Ala Arg lie Glu 
245 

Thr Glu Pro Asp 



His Ser Trp Glu 

280 

Ser Ser Thr Glu 
295 

Ala Thr Ser Leu 
310 

Lys Val Arg Glu 
325 

Gin Asp Asn Lys 



Gly Cys Val lie 

360 

Gly Phe Arg Val 
375 

Pro Lys Gly Trp 
390 

Ala Asp lie Phe 
405 

Val Pro His Pro 



Gly Gly Leu Arg 

440 

Lys lie Phe Thr 
455 



Val Leu Val lie 
155 

Ser Cys Gly Glu 
170 

Met Phe Arg lie 
185 

Ala Gly Gly Gly 



Met Thr Arg Asn 

220 

Leu Tyr Arg Gly 
235 

Glu Asn lie Arg 
250 

Gly Gly Cys Lys 
265 

Arg Gly Glu Arg 



Leu Leu Phe Gly 

300 

lie Thr Tyr Leu 
315 

Glu lie Lys Ser 
330 

Leu Asp Met Glu 
345 

Lys Glu Thr Leu 



Ala Leu Lys Thr 

380 

Asn Val lie Tyr 
395 

Thr Asn Lys Glu 
410 

Glu Asp Ala Ser 
425 

Ser Cys Val Gly 



Val Glu Leu Ala 

460 



Ala Glu Glu Val 

160 

Arg Gly Leu Leu 
175 

Ala Met Arg lie 
190 

Glu Asp Glu Gin 
205 

Leu Phe Ser Leu 



Val Lys Ala Arg 

240 

Ala Lys lie Arg 
255 

Asp Ala Leu Gin 
270 

Leu Asp Met Gin 
285 

Gly His Glu Thr 



Gly Leu Tyr Pro 

320 

Lys Gly Leu Leu 
335 

Thr Leu Glu Gin 
350 

Arg Leu Asn Pro 
365 

Phe Glu Leu Asn 



Ser lie Cys Asp 

400 

Glu Phe Asn Pro 
415 

Arg Phe Ser Phe 
430 

Lys Glu Phe Ala 
445 

Arg His Cys Asp 



Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 480 
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Tyr Pro Val Asp Ash Leu Pro Ala Arg Phe Thr Tyr Phe Gin Gly Asp 

485 490 495 

lie 



(2) INFORMATION FOR SEQ ID NO: 33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

CGCACCCCAG GAGGCGCGCT CGGAGGGAAG CCGCCACCGC CGCCGCCTCT GCCTCGGCGC 

GGAACAAACG GTTAAAGATT TTGGGCCASC GCCTCCGCGG GGGGAGGAGC CAGGGGCCCC 

AATCCCGCAA T T AAAGAT GA ACTTTGGGTG AACTAATTGT CTGACCAAGG TAACGTGGGC 

AGCAACCTGG GCCGCCTATA AAGC GGC AGC GCCGTGGGGT TTGAAGCGCT GGCGGCGGCG 

GCAGGT GGC G CGGGAGGTCG CGGCGCGCCA TGG 



(2) INFORMATION FOR SEQ ID NO: 34 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

CGCACCCCCA GGAGGCGCGC TCAGAGGGAA GCCGCCAGTG CGCCGCCTCT GCCTCGGCGC 

GGAACAAACG GTTAAAGATT TTTTTGGGCA GCGCCTCGAG GGGGGAGGAG CCAGGGGCCC 

GATCCGCAAT T AAAGAT GAA CTTTGGGTGA ACTAATTTGT CTGACCAAGG TAACGTGGGC 

AGTAACCTGG GCGGCCTTAT AAAGAGGGCG C GC GGC GGGG TTCGGAGCTA GGGAGGCGGC 

GGCAGGTGGC GCGGGAGGCT GAAGCGTGCC ATGG 



(2) INFORMATION FOR SEQ ID NO: 35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

TCGGGGGAAT TAACAC CTTT TCAAAGT GAA ATCTCAGGAT TGTCTGCCTT CTACAGGAGG 

TGGTATTAAA ATGCGCCTAT AACAAATGGT TGAGAGTTTG GAGCC GCTTC TGCCCTGTGG 

GCGGGGCGAG AT GACAC CAC AATTAAAGAT GAACTTTGGG TGAACTAATT TATCTGAGGA 

AGTTAACAGG AGGAGACCTG CGCGCAATGG ATATATAAGG GCGCGCAGGC GAGGACGCCC 

TCAGTTTGTG CGTAAAGACG CGTCTCCTCT CCAGAAGCTT GTTTTTCGTT TTGGCGATCA 



GTTGCGCGCT TCAACATGG 
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(2) INFORMATION FOR SEQ ID NO: 36 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

GATCCCAGATCTGCCTATTGCGCCCGATGCCCCGAGGCTCTCTCTTGGAC 
TCTGGCCCTGAGTTCTTCTGCGCGATCCTTCGGAGACGTCTGGAGGCCTG 
CT T TAT GCATCT CT CT T GGAC CT CAGT TTCC C CACAC GT GGGAGGAGGCA 
GCT GGAC GAT T C CT GAAAGGACT TT CC CTT GCT T C CT CATCAC GT GGAAG 
AGAGCC CACCC GGCACCTGGAAATGGAAAGC CAGT GAAGGCT GCTT T GGG 
C C GGGGC AKC GGGT GGGAC C GGGC GGGAGGGAT TC CAAAGAGAC C GC C GG 
GAAGGCTAGAGCT T GGAATT C C GGCT C CT C GGAGT C CT GGC C CT C C C C CA 
CC GCCGCCT CGGAGCT CAGCACACCTT GGAT GGGGGAGGCGGGCAGCT CC 
T AGC CC C GC ACCC CAGGAGGC GCGCT C GGAGGGAAGC C GC CAC C GC C GC C 
GCCTCTGCCTCGGCGCGGAACAAACGGTTAAAGATTTT GGGC CAS CGCCT 
C C GCGGGGGGAGGAGC CAGGGGCCCCAATC C C GCAATTAAAGATGAACTT 
T GGGT GAAC T AAT T GT C T GAC CAAGGTAAC GT GGGCAGC AAC C T GGGC C G 
CCTATAAAGCGGCAGCGCCGTGGGGTTTGAAGCGCTGGCGGCGGCGGCAG 
GTGGCGCGGGAGGTCGCGGCGCGCCATGGGGCTCCCGGCGCTGCTGGCCA 
GTGCGCTCTGCACCTTCGTGCTGCCGCTGCTGCTCTTCCTGGCTGCGATC 
AAGCT CT GGGACCT GT ACT GC GT GAGCGGCC GC GAC C GCAGT T GT GC C CT 
CCCATTGCCCCCCGGGACTATSGGSTTCCCCTTCTTTGGGGAAACCTTGC 
AGAT GNTACTNCAGGTAAGGGAGGGT GGGGC GGGAC AGGC T GC T T C C C C G 
GAGC C C GGCGCGGCT CT GGGCT T CT GCT GAAGT C GGGGT AGGC GC C C C CG 
GGAGGCATGCTATT GC GGCT AGGAGCAGGGCT GGC GGGAGC GC GGC GCT C 
CCCGGMKYMCSCTCAWGCSCRCWWKTMWCCTCCGCCTYMCTCCCAMAGCG 
GAR S AARWKC Y KGMR GAT GAAGC GC AGGAAAT AC GGC T T CAT C T ACAAGA 
CGCATCTGTTCGGGCGGCCCACCGTACGGGTGATGGGCGCGGACAATGTG 
CGGCGCATCTTGCTCGGAGAGCACCGGCTGGTGTCGGTCCACTGGCCAGC 
GT C GGT GC GCAC CAT T CT GGGAT CT GGCT GCCT CT CTAAC CT GCAC GACT 
C CT CGCACAAGCAGC GCAAGAAGGT GGGGGCAGGAGGCGAC GGCT GGACA 
GGGAGGGGGACCCCATTTAT GAGCGGAATT CCGGCT GAT GGAT GCT AGGC 
GCGGGCTAGCAGCTTGAGGTGGGCTAGGACCCTCTGCCAGCTCCAGGTTA 
GCTTTC C CAGCT CGGAGAGT GCCAT GT GT CT GGCAGGACT GGGGGT GTCT 
GGAAGGGGAC GGC GGTAGAC GAGAGGGGC GGAT G GAGGC T T T T AAC G C T G 
TCCCCTCCTCGGGACTCAGGTGATTATGCGGGCCTTCAGCCGCGAGGCAC 
T C GAAT GCTACGT GC C GGT GAT CAC C GAGGAAGT GGGC AGC AGC CT GGAG 
CAGT GGCTGAGCT GC GGCGAGC GC GGC CT C CT GGT CT AC C C C GAGGT GAA 
GCGCCTCATGTTCCGAATCGCCATGCGCATCCTACTGGGCTGCGAACCCC 
AACT GGC GGGC GAC GGGGACT C CGAGCAGCAGCTT GT GGAGGC CTT C GAG 
GAAATGACCCGCAATCTCTTCTCGCTGCCCATCGACGTGCCCTTCAGCGG 
GCTGTACCGGGTAAGGGCGGCAAACGGGCTGCGGACTAGGGGCGC GGGAC 
CTGGGCGTCTGCTCACCGCCGCGCGCTCTCTGCGCTCAGGGCATGAAGGC 
GC GGAAC CT CAT T CAC GC GC GC AT C GAGCAGAAC AT T C GC GC CAAGAT CT 
GCGGGCTGCGGGCATCCGAGGCGGGCCAGGGCTGCAAAGACGCGCTGCAG 
CT GTT GAT CGAGCACT CGT GGGAGAGGGGAGAGC GGCT GGAC AT GCAGGT 
GAGTAGCAGCTT CAGACCAGGCACT GCGGAGTTT GGT CC C C T GGC TT TC C 
AAGGCGCTGTTCCTGGGGCCCCCAAAGCGCGCGCCTGGGGCCCAGCTTTC 
T GGAGT GGGC GGCCGGCT C AGACT ACAGCT AT GGAAT C C C GAAGGAAGGC 
T GAGACACCCGGTCAGGAGAGCTGCGGAAGGGGCTGCGGMGGAAACT GGG 
AGCAT C C CCTAGCCTTTAMCAGGTTT CAAAGGGAAAGTT GGAAT T T GCAA 
AAAT GTTAATAAAGAACCTT GCGATTTTAATAAAACTAAGACTT TAACT C 
AGGAGTTTCCGGTAGRGCGGGGTCGTACTCGCCTTACTGCTCCAGCTGAA 
CTAAAGGGAC GTT GCAT TTT GTT TAAAGATATT GC T T T C C T T GAC T T T C T 
GTCAGCAAAACATTTAGCC CTTCTAGT CTT CCCT C CAGAACT CT CAGTT C 
GATT CTGAGTAATCCTTCTGT CAAACCGCAGGCAGACTT GTGAGAATGT G 
GGTCTCACTCTATTCTTAGGCACTAAAGCAATCTTCAACCGAACTCCTCT 
TT GGAGGACACGAAAC CAC GGCCAGT GCAGCCACAT CT CT GATCACT T AC 
CTGGGGCTCTACCCACATGTTCTCCAG 



(2) INFORMATION FOR . SEQ ID NO:37 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

GATCCAGGTTGCT GAAACATAT CTCCATATAGGGCAGAACAATTAT CAAA 
AGCATAAGAATTGCAGCCACAGCATAGGGAAGAAAGAGGAGTTTTTAAAC 
CACAACAAAAGGGAGAAAGAAGAGAAT T TTAACT T ACATT T AAT T CAAAA 
GT CTTCAGAGCAAC CC GAAAC CCTCCTGGAACT GGGGGATT CAGT CGAAG 
GGT CT C CTTAATAACACAC CCGAT GTAT YTAAGT T GTT C CAAAAT T T C CA 
T GT CCAACT TGTT GT CT T GATT GCT CTT GCAAAGTAAAC CCTAY CAAAAY 
AGT CAT ACAGAGGT GAACAGT YATTTTGT GCT CCAATTAAAAT CAGCCCA 
GCAGAC GTAAACAGGGCT TAAGT GGAGACT AAAC CCAAAGGGC C C CAT GA 
T GGGAGAGACT GGGAGGGGGAAAC AGCAGCTAAT GGCCAT T T GC CT GC C C 
AAAT C CACTAT CT ATTT ACAAT CC CAGGAGAAT GCTGCT CAC CAGTTAGA 
AGGACCAAGTTTCTCCCCACGCCCCCCCACCCCACACTCACCACCACCAC 
C CACACTAAT CAGCTAT T CACACT ATGTAT GC C CT T GGACACAC CAAT T C 
AAGAAAAGT GGAAC CTAT CTGAGAAT CT CCAC GGTT CACAAAAAGGT GGA 
GGAGGGGTAGGAATACAAGGT CAAACCCTGCCC 



(2) INFORMATION FOR SEQ ID NO: 38 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

TCGCGAGGAGCGACCACGGCTT GAAGAGGGGTAGACGAGACCAGAT GCT C 
CCCGQC GCCCCCTCATGCGGGTTGCGGTCTCTCTCCTCCACCTCCCTCTC 
AGCGGAGGAAGTTTCTGCAGATGAAGCGCAGGAAATAC GGCTT CAT CTAC 
AAGACGCATCTGTTTGGGCGGCCCACGGTGCGGGTGATGGGCGCGGATAA 
T GT GC GGC GCAT CTT GCT GGGAGAGCAC C GGTT GGT GT C GGT GCACT GGC 
CCGCGTCGGTGCGCACCATCCTGGGCGCTGGCTGCCTCTCCAACCTGCAC 
GATTC CT C GCACAAGCAGC GAAAGAAGGT GAGGGT GAGCT GGCAACT C CT 
T GGCT GGCAGGGAGACCT CAT CCT AT GGCT T GGT T CAGGC AAAAT AGAAT 
GC GGGGC GAGGGCTAGT C CTAT GT GGT GGGGAC CAGGAC C CT C T CTAT CT 
GAGATCCACTTTAGCTTTTCTGCTAGCACGTGGGTTAGTCCTGGGGGGGA 
CTGAAATTCTTGAAAGGGTACTCGGAAAGGCGAAGGGGGGGGGGCTGAGG 
GAAAGT AGAGGATT GTAACACT CT CT GCT C CT GGGGGGT GCT CAGGT GAT 
TAT GCAGGC CTT CAGC C GCGAGGCACT C CAGT GCTAC GT GC C C GT GAT C G 
CT GAGGAAGT CAGCAGTT GTCT GGAGCAGT GGCTAAGCT GC GGC GAGC GC 
GGCCTCCTGGTCTACCCCGAGGTGAAGCGCCTCATGTTCCGCATCGCCAT 
GCGCAT C CT GCT GGGCT GC GAGC C GGGT C CAGC GGGC GGC GGGGAGGAC G 
AGCAGCAGCT C GT GGAGGCTTT C GAGGAGATGACC CGCAAT CT C TTCTC T 
CT T CC CAT TGAC GTGC CCTTTAGC GGC CT GT AC C GGGT AAGGGC GGTT T G 
C GGAGT C GGAGTAGGGGAACGCAAGCT C GGGCAT CCGCT CAC C GC CAC GC 
T CT CTCCGCGCT CAGGGCGTGAAGGCGCGGAACCTTATACAC GC GCGCAT 
C GAGGAGAACATT CGC GC CAAGAT CCGCC GGCT T CAGGCT AC AGAGC C GG 
ATGGGGGTTGCAAGGACGCGCT GCAGCT CCT GATT GAGC ACT C GT GGGAG 
AGGGGAGAGAGGCTGGATATGCAGGTGAGAAGCAATTTCAAAAGGTGCCA 
AGGGCCGGGGAGT GCCTCT GACTTTCCAGACACACTTT CTGGGGT CT CCA 
AAGCC CT GT CAAGGCCC CAGC TACT T C CAAGT GGGC GGC GAT GCT AGGT C 
TAGAGCTTTTCAACCTGTGGGTCGTGACCCCTTCACGGAGCCAAACAACC 
CTTTCAGAAGGGTCGCCTAAGAGCAT CTGCATAT CCGAT ATTT AC AT CAA 
GAAACATAACAGTAGCAAAATTACCGT TAT GAAGTAGCAACAAAGATAAT 
TT TAT CGTTGGGGGT CACCACAACAC GAGGAAC C GTATT AAAGGGT GGCA 
TTGGTCTAGAGAGCT GTGGAAGGGGGTGGCTGAGCAAT GGGGAAGAT C C C 
AAAGTT CAAAGGGCAAGGCTCAT CTACAAAGGTTAAAGC GGAAGAGCAGG 
ATTAAGGGAGTTTTGCGTTTTTGTTTGTGGTCTTTGACTTTCTATGAACA 
AAACGGATTTTACCCTTGAAGTCTTCCGTGCAATATTCTCAGGTCAGGTC 
T TT GTAACAGT GCTATAAACT GCACT CAGAT CT GTATAAACT T C C GTT TT 
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TATCCTTAGGCACTAAAACAATCGTCAACAGAGCTCCTCTTTGGTGGTCA 
T GAAACTACAGC CAGT GCTGCGAC GTCACT GATCACT TACC TAGGAC T CT 
ACCCACATGTCCTCCAGAAAGTTCGAGAAGAGATAAAGAGCAAGGTAGGA 
T GAT T C TAGAGGT T C C C CATTT GCCTAGGACAT T C CT CTAT TAAC CAC CA 
C CAC CACCCCCACTGTATATAAGTT T GCT CGATACAC CCAGTACT AT GAC 
AGT GAAGAT CT GAGAGCT AGGT GGGACT GT GGGGGAGAGAC T C CAC C T C G 
TGAATTTAAAAAGGCAGTT GTTT GTACT GGGCT CT CT CTT GGGCAGAATT 
TGACCCTCTCCTCCTCCTCCTCCTCCTCCTCCTCTTCCTCCTCCACCACC 
ACCACCATCACCACCTTTTATAGAGCAAGGTTCTCCTTTCCCTGACCAAG 
AAC AT GAAT AAT GT GAT T AGAGC C AAT AGC T GAT C AGGGT C GC AGT GT T G 
GT GAGGGCT CAGGGTAT GACCCTTTATATAC CT GATAAGCAACAT T GT CT 
GGATAAT GGGTTTAGGCT GAGGAAGTGTGGAAAGGAAGGCCATCAGGCCA 
TCAGCTCTTTCCCTTTTATCCTCTCCCATCCAGACGCCTTCAGGTTTAGT 
TAACAGGT GAGTC CTGCT GGGCTGACTTT TT TTTTGGAGTGC C CAGGGAT 
CCATCACTCACTTTTTTATCTGTTTCCATAGGGCTTACTTTGCAAGAGCA 
ATCAAGACAACAAGTTAGACATGGAAACTTTGGCACAGCTTAAATACACT 
GGGT GT GT CAT TAAGGAGAC C CT GC GAT T GAAT C C T C C GGT T C CAGGAGG 
GTTT C GGGTTGCT CTGAAGACTTTT GAGCT GAAT GT GAGT GCAC CT C CT G 
TCCCCCACCCCCAGCCCTCGTCCACGTCCACTCTGCTATGCTGTTGAGCA 
T CAGCT GCCCAGAGCAGT GGCTCACT GCCCTTGACAGTGTCCT GCCTCCT 
AT GGT ACT GGGAAC CAATTT GCT CT CCT CT CTTAAT GC CAT C CAT GCT AG 
TAAT GACT TTTT GT TGT T GCAAGCT CAGGGC C GGGAT T GT CAAT T CTT AG 
GATTTT T TT TTTT T T TTAAACAGGGAT ACCAGAT C C C CAAGGGCT GGAAT 
GTT ATT TACAGTAT CT GT GACACCCACGAT GT GGCAGATAT CTT CACTAA 
CAAGGAGGAATTTAATCCCGACCGCTTTATAGTGCCTCATCCAGAGGATG 
CTTCCCGGTTCAGCTTCATTCCATTTGGAGGAGGCCTTCGGAGCTGTGTA 
GGCAAAGAGTTTGCAAAAATTCTTCTTAAGATATTTACAGTGGAGCTGGC 
TAGGCACT GTGATT GGCAGC TTCTAAAT GGAC CT C C TACAAT GAAGACAA 
GCCCCACT GTGTAC CCT GT GGACAAT CT CC CT GCAAGAT TTAC C CACT T C 
CAGGGAGATATCT GATAGCTATTTCAATTCTT GGACTTATTT GAAGTGTA 
TAT T GT T T T T T T T AAAAT AGT GT CAT GT T GAC T T TAT T TAAT T T C T AAAT 
GT AT AGT AT GAT AT T TAT GT GT CT CT ACT ACAGT CC C GT GGT CT T AAAT A 
T T AAAAT AAT GAAT T T GT AT GAT T T C CC AAT AAAGT AAAAT T AAAAAGT G 
CTTCTCTTGCTTTTTAAGATTCTTGTTGGCAAGCTGCCCATGGTGGTACA 
TT GCT GTAATACT AGGACTT GGAAGGT GGAGGCAAGAAGAGC AGGCAT T C 
AAGGCTAGC CT GGGCT ACAGAAAT C CT GT CT T AAAC AAAC AC T AC AACAA 
AAAGT CCT GTTAGGGAAT CT GACT GGCT CAGT GTTT GTACTTT GT GTATT 
TAAAATGATTTAGAGTGAAAC CATAGGTCTCT C CCC CAT GTCAGAAAATA 
TATATTATTATGT GTAT GCT GATCCAAAGTAT CTTT GTAACTTTTTCTAA 
GGT CATTGAGACT T CATATT TTGAAATTGTAT GGAGGC TAGT TATATTAC 
ATTATTTATTTATTTATTTATTTACATTTTTATGGTGCTGGGGATTGGAT 
CGAAGGCTTCACACCTCTAGGGCAAGCCCTTTGTCATTAAGGCGCTGCCT 
CTCCCTTTCAGCCCAACGTTAATTCTAGATTCTTTTT CTTT GGT GCTTTT 
GGGAGGTAAACCT GGGAT GCT GCAGTT AT T T GGT GGT GGT C GT T GGT T T T 
ACT CTAGAGAGAAGGCAACTT TGGGAAGGCAACACT GCT GCT GGT GAGT C 
GGGAAGCAT CAT CCCAGAGCAACGGGGTCAGCATAGCTAACATTTTAAAT 
CAGCATAAT GAAT C CCT GT CAT AT GGAGGAGGCAGAAGT CCT CT T T GAAG 
T T GATATTT TAGATAAGACAGAGC CAGCC CCT CT GGT TAT GGACAGTT CT 
T AC C C AAAAT GAAACAGAGAAGAAAAC CACT GGT GT GT CAC C T T T C CT T A 
GAAGTGCTTCAGGA 



